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Abstract 
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1 Introduction 


A rooted graph is a pair {H, v) where is a graph and v € V (H) is a distinguished 
vertex called the root. We often use only the symbol H to denote {H, v); in this case 
we write root(iJ) = v. For a graph G and its vertex v, let Sr be the function that 
maps (G, v) to the the rooted graph {H, v) where H is the subgraph induced on the 
vertices of G with distance from v at most r. We simplify Br{G,v) = Br{{G,v)) 
for Br and other functions on rooted graphs. 

A graph is locally finite if the degree of each of its vertex is finite. Denote by 
= the isomorphism relation between connected rooted graphs which preserves the 
root. Let (C/*, dioc) be the space of rooted connected locally finite graphs with 
equivalence relation = and distance 

dioc{Gi,G2) = 

Consider a sequence of finite graphs {Gn,n = l,2,...}. In this paper we assume 
|f^(G„)| > 1 for n > 1. Let v* = r;* be a uniformly random vertex from V(Gn). 
The component of Gn containing v* together with root v* induces a Borel measure 
fJ-n on {G*,dioc) for each n. Let fi* be another Borel measure on {G*,dioc), and 
denote by G* a random element^ with law p,*. Following mil [311132], we say that 
G* is the local weak limit of {G„} and write G„ —>■ G* if and only if the measures 
pin converge weakly to /i*: for each continuous bounded function /:!/*—>■]& 

E/(G„,^;*)^E/(G*). (1) 

Here and below all limits are as n —>■ oo, unless stated otherwise. Since {G*,dioc) is 
separable and complete [I], a standard argument (e.g., Corollary 1 of 0) shows that 
Gn —>■ G* if and only if for each non-negative integer r and each rooted connected 
graph H 

¥iBriGn,v*) P(H,(G*) ^ H). 

We focus on models of random graphs with bounded average degree. Among others, 
the inhomogeneous random graph model of Bollobas, Janson and Riordan [TO] and 
the preferential attachment model, see Berger, Borgs, Chayes and Saberi |6] have 
been shown to have a weak limit (in an explicit form). The local weak limit, 
if it exists, yields a lot of information about the asymptotics of various graph 
parameters, see, e.g., [I Ha EDI El El]. 

The present contribution consists of a general result, relating the asymptotics 
of subgraph counts with the weak limit, and its application in the area of random 
intersection graphs. This covers results of several previous papers, see Section |4l 

^Without loss of generality we assume that all random objects we define in the paper, 
are random elements in a single probability space (D, P) with the specified laws; the 
integration E is over fl. 
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2 Weak local limit and subgraph counts 

In Section 7 of [20] Bollobas, Janson and Riordan remark that the local weak 
limit does not always determine the global subgraph count asymptotics, see also 
Example 12.21 below. They propose an extra condition of “exponentially bounded 
tree counts”. We show that a simple condition on the degree moment is sufficient; 
and, in general, necessary. 

A homomorphism from a graph H to a graph G is a mapping from V{H) 
to V{G) that maps adjacent vertices in H to adjacent vertices in G. Denote by 
emb(i7, G) the number of embeddings (injective homomorphisms) from H to G. 
For a rooted graph H' let emb^(iJ', G, u) denote the number of embeddings from 
H' to G that map r{H') to v. Let 77.(77) denote the set of all \ V{H)\ possible rooted 
graphs obtained from a graph H. Finally let dc{v) denote the degree of vertex v 
in G. 

Theorem 2.1 Let h > 2 be an integer, let {G„,n = 1,2...} he a sequence of 
graphs, such that ni = ni(n) = |F(G„)| —>■ oo and Gn^G*. Write d* = dc*{r*), 
where r* = root(G*), and assume E((i*)^“^ < oo. Denote by dn the degree of a 
uniformly random vertex in Gn- Then the following statements are equivalent: 

(i) ^E(d*)'‘-i; 

(ii) dlf~^ is uniformly integrable; 

(Hi) for any connected graph H on h vertices and any H' G 'R-iH) 

emh{H, Gn) ^ E emb'(i7', G*,r*). 

The above theorem provides a sufficient condition for the continuous but not 
necessarily bounded function fn ■ {G*,dioc) R defined by (G, v) = emb'{H, G, v) 
to satisfy (|T|) . It is easy to construct weakly convergent sequences for which ©-([ml) 
fail to hold: 

Example 2.2 Let Gn be as in Theorem, \2.1\ and assume \V{Gn)\ = n. Let G}^ be 
obtained by merging edges of a clique on a subset Sn of Gn- If |<S'ra| = and 

l^nl = o(n) then G'„^G*, but (Oj) - fml) do not hold for G'^. 

The proof of the theorem is given in Section jSj It follows from the next basic but 
not widely known result of Sidorenko [37]. 

Theorem 2.3 (Sidorenko, 1994) Let H be a connected graph on h vertices. Then 
for any graph G 


hom{H,G) <hoT[i{Ki^h-i,G) = ^ do{v)'^ 

vGV{G) 
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Here hom(H, G) is the number of homomorphisms from H to G and Ki s is the 
complete bipartite graph with part sizes 1 and s. A special case for H path has 
been rediscovered in , see also [53] . 

Below, in Section jj] we restate Theorem O in a random setting and demon¬ 
strate how it implies general results on network statistics expressible through sub¬ 
graph counts such as the clustering and assortativity coefficients. 

3 Uncorrelated random clique trees 

Random intersection graphs were introduced in |33j and received some attention 
as a potential model for large empirical networks with clustering; see, e.g., surveys 
[ildS]. We show that in the regime which yields sparse graphs with a positive 
clustering coefficient in such models the weak limit is very specific, namely it is a 
uncorrelated random clique tree, defined formally below. 

Let H = be a bipartite graph. The intersection graph G = G{H) 

of H is the graph on the vertex set V(G) = with edges 

E(G) = {uv :3 wGV'^ such that uw,wv G H}, 

where e G is a shorthand for e G E{H). An intersection graph of a random 
bipartite graph H is called a random intersection graph. It will be convenient 
to assume that R® consists of the first Ui elements of a countable set V*, where 
n = 0. The set is often called the set of attribute.^. We will call the 
elements of V* vertices of type i. For v € V’' we denote Sy = Th{v), Xy = |S'„| 
where Th{x) is the set of neighbours of v in the graph El. Sometimes we will want 
to stress the type of v in the notation. Since V'' = ...} consists of the 

first Ui vertices of Vi, for v = we will set Xy"^ := Xy and Sy"^ := Sy. We will 
denote by A ^ V the fact that X and Y have the same distribution. 

Many different variants of the random bipartite graph El have been studied, 
see, e.g., survey papers [Hug: 

• the active random intersection graph: each v & independently chooses 
Xy^^ from a distribution P on { 0 ,..., m}, then draws a uniformly random 
subset Sf of size Xy^^ of its neighbours from (independently of other 
vertices). A special case is the binomial random intersection graph; 

• the passive random intersection graph: each v G independently chooses 

/o') 

Xy from a distribution P on {0,... ,m}, then draws a uniformly random 
subset Sy of size Xy^'' of its neighbours from (independently of other- 
vertices) . 

• the inhomogeneous random intersection graph Q’^^h-omog ^ 77 , 2 , ^ ^( 2 )). 

vertices r; G V* are independently assigned random non-negative weights 

^The names V and W are often used in the literature for and V^. 
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^ Given the weights, edges vw appear in H independently with 

probability niin(^^^^, 1). 

We will also consider random intersection graphs ^ 2 ) based on the 

configuration model, see [29l [38]. Let di = {dij,j = 1,..., m} and ^2 = {d 2 k, k = 
1,..., 712 } be sequences of non-negative integers such that dij = The 

corresponding random bipartite multigraph ^ 2 ) with parts {V ^of 

sizes TT-i and n 2 is obtained as follows. Distribute the total number of 2 ^ dij 
half edges among the vertices of U so that j-th vertex of part i receives dij 
half-edges. Pick a uniformly random perfect matching between the half-edges of 
parts and V'^. In the bipartite graph, add an edge between u and v whenever 
a half-edge from u is matched with a half-edge from v (we allow multiedges). 

Usually, see, e.g., the above models yield random graphs with a linear 
number of edges and a clustering coefficient bounded away from zero only if ^ = 
0(1). Therefore we will assume ^ = 0(1) in this paper. 

Let /i be the distribution of a random variable Z on [0, 00 ) with 0 < EZ < 00 . 
We denote by Z* a random variable with the size-biased distributior^ 

= (EZ)-i [ tdfj.{t) 

Ja 

for any Borel set A. If Z is integer valued, then ]P{Z* = k) = (E Z)~^kF{Z = k). 

Given two random variables Di,D 2 on {0,1,2,...} with E Di,E D 2 € (0,oo) 
define a multi-type Galton-Watson process T = T(Di, D 2 ) as follows. S{0) consists 
of a single root node r = root(T). The root r has a set S'(l) of offspring, where 
|S'(1)| ^ £> 1 . For each fc > 1, S{k -\- 1) consists of the offspring of the nodes in 
S{k). Given |5'(/c)|, the number of offspring of each node in S{k) is independent 
and distributed as — 1- Here i{k) = 2 if fc is odd and i{k) = 1 otherwise. We 
call S'(fc), the set of vertices at distance k from the root, the generation k oiT- A 
corresponding random tree, also denoted by T, is a graph on the vertex set yJkS{k) 
with edges {uv : u is an offspring of u\ and root r. Gonsider 7” as a bipartite graph 
with parts (Ui,V 2 ), where Vi and V 2 consists of all nodes in generations 0 , 2 ,... 
and 1,3,... respectively. We define the uncorrelated random clique tree Gj- to be 
the intersection graph of T rooted at r. 

For a finite (random) sequence A, we write X Gy A to denote the fact that 
X is chosen uniformly at random from all the elements of A (given A). For ran¬ 
dom variables Z, Zi, Z 2 , ■ ■ ■ we denote by Z the fact that converges in 

distribution to Z. 

Let iL be a rooted connected graph. For a (multi-)grapl|l G of size ni > 1 

^We follow the star-notation of other authors, see, e.g., mm- We also use symbols 
such as G* ,d*, v* to denote objects unrelated to size-biased random variables, the actual 
meaning should be clear from the context. 

"^For multigraphs we define Gr — G 2 if and only if there are bijections (j>i : U(Gr) -A 
V{G 2 ) and 02 : E{G\) -A E{G 2 ) such that 0r maps the endpoints of e to the endpoints 
of 02(e) for each edge e € Gi, and 0i(root(Gr)) = 02(root(G2)). 
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denote Pr{G, H) = ni^\{v G G : Br{G, v) = H}\, and set Pr{G, iJ) = 0 if ni = 0. 
Let {Gn, n = 1,2,...} be sequence of random graphs and let G* be random graph 


on (t/*, dioc)- We write G„ A G* as n —>■ oo if for each non-negative integer r and 
each finite H G G* 


Pr{Gn,H)AF{Br{G*)=H). 


( 2 ) 


Theorem 3.1 Let {G„} be a sequence of random intersection graphs where the 
underlying bipartite graphs are Hn = {V^,V'^, F) with = V^{n) 

and F = F{n). For z = 1, 2 let v* Gu 1^*, n-i = ni(n) = |T*| and = X^'‘\n) = 

Suppose {nil, { 77 , 2 } are sequences of positive integers, such that ni,n 2 00 , 
n^jnx —>■ /3 G (0, 00 ) and 

(i) either Gn, n = 1, 2,... is an active random intersection graph and there is a 
random variable Di with EHi G (0,oo) such that ^ EHi and 

X^^^ A Dp, (3) 


(ii) or Gn, n = 1,2,... is a passive random intersection graph and there is a 
random variable D 2 with E £)2 G (0,oo) such that EX*^^) —>■ EZI 2 and 


X(2) 


D 


D2 


(4) 


(Hi) or Gn = G®”^°’"°®(ni, 772, n = 1, 2,..., such that for i = 1,2 0 < 

< 00 and does not depend on n; 

(iv) or Gn = {di,d 2 ), n = 1 , 2 ,..., where di = di{n) and d 2 = d 2 {n) are 

random sequences of length m and 772 respectively. Let mj be the number 
of elements in di that equal j and let N = Suppose for 

7 = 1,2 there is a random variable Di with KDi G (0, 00 ) sueh that 


N 


m 


A EA 


^ A P(A 

77 * 


j) for j = 1,2,... 


Then both (0) and ^ hold and Gn A Gj- with T = T{Di, D2). 


The proof is given in Appendix 

Recall that given a non-negative random variable X, a mixed Poisson random 
variable with parameter X attains value k with probability E 
fc = 0,1,_We denote this distribution by Po{X). 


Remark 3.2 (see also U(A \llf ) In the case (i) we have D 2 ~ Po{l3 ^EHi), 
in the ease (ii) we have Di ^ Po(/?EH 2 ) and in the case (Hi) we have Di ^ 
~ (i)-(w) j3¥,D2 = ¥.Di. 
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Remark 3.3 For arbitrary random variables D'^, D '2 on {0,1,2,...} with positive 
means, there is a sequence of random configuration intersection graphs as in (iv) 
for which Di = D[, D 2 = D^. 

Thus in the active, the passive and the inhomogeneous model either Di, or D 2 , 
or both, has a (mixed) Poisson distribution. The configuration model generalises 
these models in terms of local weak limits. For example, both Di and D 2 can be 
power-law. 

The fact that T{Di, D 2 ) is a limit for many sparse bipartite graph sequences 
is intuitive and in some physics literature has been assumed implicitly [341135] . A 
result similar to Theorem IQ (iv) can be found in [^[30]. For completeness, we 
provide our own, more formal proof in the Apendix. We do not use the second 
moment condition and work under slightly weaker assumptions (convergence in 
probability). We are not aware of literature on weak limits in cases (i)-(iii). 


4 Applications 

The proofs of the results in this section are given in Section |6l Let {Gn} be 
a sequence of random graphs with \V{Gn)\ > 1, and G* a random element on 
iG*,dioc)- Assume \V{Gn)\ > 1 for all n and let v* = w* be a chosen uniformly at 
random from V{Gn) (given G„). 

4.1 Subgraph counts in random graphs 

To apply Theorem [2T] in a random setting we need some easy technical facts. They 
follow by a simple application of the Levy-Prokhorov metric, although we do not 
use it here. 

Lemma 4.1 Suppose Gn A- G* and {G„} are defined on the same probability spac^. 
Then there is a random set A of positive integers such that 

a) P(n £ A) —>■ 1 as n —>■ 00 and 

b) |A| = 00 and Gn —^ G* as n ^ 00 , n £ A almost surely. 

Lemma 4.2 G„ A G* if and only if for each bounded continuous function f : 
ig,,dioc)^T^we /iai;eE(/(G„,z;*)|G„) AE/(G*). 

We now restate Theorem [Q for sequences of random graphs. 

Lemma 4.3 Let h >2 be an integer, suppose Gn A G* and assume inf \V{Gn)\ —>■ 
00 . As before, denote d* = dc*ir*), r* = root(G*) and assume E(d*)^“^ < 00 . 
Denote by dn the degree of a uniformly random vertex in Gn- Then the following 
statements are equivalent: 

®Here this assumption is essential; in the rest of the paper it only affects the notation. 
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(i) 

(a) is uniformly integrable; 

(Hi) for any connected graph H on h vertices and any H' S 'R-{H) 

emb{H, Gn) ^ E emb'{H',G*,r*). 

Each of the above statements implies that for any connected graph H on h vertices 
and any H' G Ti-iH) 


nf^emh{H,Gn) ^Eemb'{H',G*,r*). (5) 


4.2 General weakly convergent sequences 

In this section we assume that Gn A G*, ni = ni(n) = |k(G„)| > 3 is non-random 
and ni —>■ oo. As before, is the degree of v* = v*(n) in G„ and d* is the degree 
of the root r* of G*. Lemma Hd] yields convergence of emb(iL, G„) provided 
that {\V{H)\ — l)th degree moment of G„ converges. This allows us to determine 
the limit behaviour of statistics based on subgraph counts. 

The clustering coefficient of a graph G is defined as 


a(G) := 


emb(Ar3, G) 

emb(P 3 ,G) ’ 


where K 3 is the clique on 3 vertices and Pt is the path on t vertices. (Set a(G) := 0 
when the denominator is zero.) For a rooted graph H' let hom'(iJ', G, u) denote 
the number of homomorphisms from H' to G that map r{H') to v. Let K'^ be K 3 
rooted at any vertex. For t > 2, let K'^ be the bipartite graph ATi.t rooted at the 
vertex of degree t. 

Corollary 4.4 Suppose Gn^G* andEd^ —>■ E(d*)^ £ (0,oo). Then 

, * Eemh'iK'„G*,r*) ^ E emW{K^,G*,r*) 

“ ■ Eemh'{K[2,GGr*) E(d*)2 


The assortativity coefficienf see e.g. [111130], for a graph G is defined as Pear¬ 
son’s correlation of the degrees over the neighbouring vertices 


r(G) := 


g(G) - b(G)^ 

6'(G)-6(G)2’ 


where 


g(G) := (2e(G)) ^'^dG{u)dGiv); 
b'{G) := (2e(G))-i^dG(u)2; 
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6 (G) := (2e(G))-i^dG(w); 
e(G) := |F;(G)| 






and the sums are over all 2e(G) ordered pairs ( m , v) of adjacent vertices in G. We 
define r(G) := 0 when either e(G) or b'(G) — b(G)^ is zero (i.e., G is regular). The 
above quantities can be easily expressed in terms of subgraph count statistics, see, 
e.g., [12] and Section [51 Denote by P 4 the graph F 4 rooted at one of its internal 
vertices. 


Corollary 4.5 Suppose Gn A G*, —?> E (d*)^ <00 and Var{d*) > 0. Then 


Er(G„) Ap* 


Ed*E hom'(P.^,G*,r*) - (E {d*ff 
Ed*E(d*)3 - (E (d*)2)2 


( 6 ) 


Corollaries 14.41 and 14.51 easily follow from Lemma 14.11 see Section m Notice that 
since a(G),r(G) £ [0,1], convergence in probability in these corollaries implies 
convergence of means. 

Statistics that can be expressed in terms of integrals of bounded functions, 
such as the limit degree distribution, are obtained directly from the local weak 
limit. Hence no degree moment conditions are necessary. Let 7 rfc(G) be the fraction 
of vertices of degree k in G. By Lemma 


7rfc(G„) =E(Irf^^(,.)=,.|G„) AP(d* = fc). (7) 

Given a graph G and an integer fc > 2, let be a uniformly random 

triple of distinct vertices from V{G). The eonditional clustering coefficient is 


afe(G) := £ Glvlvlv^v* £ G,d(u*) = fc), 

and set ak{G) := 0 if the event in the condition has probability zero. Lemma Td. 2 1 
implies that if P(d* = k) > Q then 


OikiGn) cx*k — 


EId.=fc emb'(i£^,G*,r*) 


( 8 ) 


k{k - l)P(d* = k) ■ 

The conditional assortativity, see m is defined as 

rfe(G) :=E(dGK*)IA^2* € G,dK) = fc), 

and set rk{G) := 0 if the event in the condition has probability zero. Let P 3 be P 3 
rooted at one of the endpoints. 

Corollary 4.6 Suppose Gn A G*, Ed^ —?> E (d*)^ <00 and P(d* = fc) > 0. Then 


^k{Gn) A — 


EId.=fehom'(Pj,G*,r*) emb'(Pj, G*, r*) 


fcP(d* = k) 


= 1 + 


/cP(d* = k) 


In a similar way we can study the bivariate degree distribution m and many 
other functionals. 


9 






4.3 The case of random intersection graphs 

Here we apply the above general results in the case where the limit is the uncor¬ 
related clique tree of Section |31 We stress that Theorem 12.11 and its corollaries 
are applicable for a much broader class of sequences; including the inhomogeneous 
sparse random graph and the preferential attachment model liiTi]. 

Theorem 13.II yields the first main condition (convergence to a local weak limit) 
for Lemma l4.ll For the other condition (convergence of a degree moment) we prove 


Lemma 4.7 Let {Gn} be a sequence as in Theorem \3.1\ and let k be a positive in¬ 
teger. Suppose an additional condition for each of the cases (i)-(iv) of Theorem \3.1\ 
holds: 

(i) E(X(i))'= < oo; 

(a) E(x(2))fc+1 ^ ED^+i < oo; 

(Hi) E (^^^^)^ < oo and E (^^^^)^^^ < oo; 

(iv) ED( <oo, ED^+i < oo and there is a function /n —^ 0 such that 

rii 

PdEDr-n-i^d^l >/„) = o(n^2fc) * = 1^2, m = 1,..., fc + i - 1. 
i=i 

Then E (d*)^ < 00 and Ed^ —^ E (d*)^. 

A special case (i), fc < 2 was shown in m- Here we use a different argument based 
on Theorem 13.11 see Section | 6 l 

Using the same notation as in Section 021 assume that Gn A G* ^ Gj- where 
T = T{Di, D2), EDi > 0 and EII 2 > 0 . Let ^ 1 ,^ 2 , • • • ~ — 1 be independent 

and independent of Di. By d?]) we have 7 rfc(G„) A P(d* = k) = P(X]fc\ A = 
k). For sequences of graphs as in Theorem I3.1f il. (ii), (iii) the corresponding 
convergence of means has been shown in mm. see also Remark O We also 
notice that the second moment condition required in m for the inhomogeneous 
model is not necessary. 

By simple calculations we get 

= E(Zi)fe=E(U 2 )fc+i(EU 2 )-i fc=l, 2 ,...; 

lil JJ 2 

Ed* = E{Zi+--- + Zd^) = EDiEZi; 

E{d*f = EDiEZf+E{Di)2{EZif; 

E{d*f = EDiEZ^ + 3E {Di)2 EZiEZ(+E{Di)3{EZif; 

E emb'(A: 3 , G*,r*) =EDiE{Zi)2 =EDi{EZ(-EZi) 
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and 


E horn {PiG*,r*) = EDiEZf+E(Di) 2 EZiEZl + {d*fE Z^. (9) 

IE Di 

(The above estimates hold also in the case when either side is infinite.) 

When EDf < oo and E Df < oo, we have 

^ E Z^iE Z? 2 lE (£^ 2)3 

“ ^ ED 1 ED 2 E { 02 ) 3 +E{Di)2iE ( 02 ) 2 )^' 

in Corollary 1131 Using R.emark [3.21 this simplifies to a* = E Di/E Df for active and 
to a* = E (£> 2 ) 3 (]E (.^ 2)3 + /3(E ( 152 ) 2 )”^ for passive random intersection graphs. 
This is equal to a related estimate a = limE emb(if 3 , G„)(E emb(P 3 , G„))“^ ob¬ 
tained by Bloznelis m and the estimates of Godehardt, Jaworski and Rybarczyk 
m for these particular models. 

Similarly, if E Df < 00 and E < 00 then p* in Corollary 14.51 is a rational 
function of E£>i, EDf and ED 2 , k = 1,2, 3,4 obtained by using (ID) and the above 
expressions for E{d*y in (|6|). One can check by simple algebra that p* is equal 
to p = lim(Eg(G„) — E6(G„)^)(E&'(G„) — E6(G„)^) computed in [TS] for sparse 
passive and active random intersection graphs. 

Assuming only that P(d* = fc) > 0 we get in (IH) 


a 


* 

k 


E(j2fliZyz,-y\d* = k) 
k{k-l) 


( 10 ) 


If D 2 ^ Po(A) as is the case, e.g., for the active random intersection graph of 
Theorem EH i), then as in |16) (but without a second moment assumption) 


, AP(d* = fc - 1) 
kV{d* = k) 


( 11 ) 


Finally, if EOj < 00 , EiA| < 00 in Corollary 14.61 we have 


/D, 

rl = k-^E 

\i=i 

This agrees with a related estimate obtained for active and passive random inter¬ 
section graphs in m- 

Thus Corollaries 14.41 - 14.61 generalise several previous results for particular ran¬ 
dom intersection graph models to arbitrary sequences of graphs with the uncor¬ 
related clique tree as a limit. Applying them together with Lemma 14.71 with an 
appropriate k yields slightly stronger versions (i.e., convergence in probability and 
optimal moment conditions) of these results for the active and passive random 
intersection graphs with bounded expected degree. We are not aware of similar 
results for the inhomogeneous and configuration models. 


d* =k\ + 


E (iAi)2E ( 1 ) 2)2 
E DiE D 2 
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5 Proof of Theorem 12.1 


Recall that a sequence of random variables {Xn, n = 1,2,...} is uniformly inte- 
grable if sup„_,.^ sup„ E |X„|Ix„>a = 0, equivalently, if E 0 for any 

Wn —>■ oo. A basic fact, see e.g. [7] p. 32, is 

Lemma 5.1 Suppose random variables X*,Xn,n = 1,2,... are non-negative, in- 
tegrable and X^ converges to X* in distribution as n —>■ oo. Then {X^} is uniformly 
integrable if and only if E X^ —>■ E X*. 

Proof of Theorem 12.11 In the proof denote G = Gn- 

(E)]. dn converges in distribution to d* by (ED- Thus d^ ^ converges in 
distribution to {d*y^~^ and the proof follows by Lemma ISTl 

n\)^ (fml). Suppose (0 holds. Fix any connected graph H with \ V{H)\ = h. Let 
r be the diameter of H, and let v* be a uniformly random vertex from V (G). Write 
bj{G,v) = \Bj{G,v)\ and b* = \Bj{G*)\. Note that P(&* = oo) = 0 for j = 0,1,... 
since G* is locally finite. For a rooted graph H' denote 

X{H') = emb'{H',G,v*) and X*{H') = emb' [H' ,G* ,r*). 

Using Sidorenko’s theorem. Theorem 12.31 for any H' G 

E A(i7') = emh{H, G) < hom(iJ, G) 

< n^^E hom(A:i,?,_i,G) = Ed^-^ ^ E(d*)'"-i < oo. (12) 

Next, we have X{H') X*{H) in distribution for each H' G B-iH) (apply El) 

to the continuous and bounded function f{G,v) = iernb'{G.H' ,v)=k)- Therefore by 
Fatou’s lemma and ESI, 

EX*(iJ') < liminfEA(iJ') < oo. (13) 

Let e G (0,1). Since E((i*)^’'“^ and EX*{H') are finite, we can find a t > 0 such 
that 

E id*f-Hd^>t < E (d*)'‘-ilb.^^>t < e and 
EA*(Lf')l6*+i>‘ < e each H' G 7^(i^). 

Pick s > t large enough that P(&}_|_]^ > s) < By Lemma [5.II and (H)) for 


each H' G TZ{H) 

EdO-%„<t^E(d*)'‘-%.<*; (14) 

EA(if')l6.+i(G..*)<. ^ EA*(iJ')Ifa*^,<. > EX*{H') - e; (15) 

EA(i/')I^+,(G,.*)6(M] ^ < e; (16) 

P(6,+i(u*) > s) ^ P(6;+i > s) < (17) 
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Define subsets of V{G): 

Ri := {w : dciv) > t}] i ?2 := {v : br+i{G,v) > s}. 

We call an embedding a oi H into G bad if its image shares a vertex with RiU R 2 . 
Denote the set of all bad embeddings by Xbad- Note that for H' G R{H) 

0 < emhiH,G) - niEXiH')lb^^^^G,v^)<s < \Xbad\- (18) 

Let Xi bet the set of all embeddings a whose image intersects both i?i and V \ 
{Ri U i? 2 )- Let X 2 = Xbad \ dfi. By Theorem 12.31 the number of bad embeddings 
which have the image entirely contained in Ri is 

emb(iL,G[i?i]) < hom(iJ, G[i?i]) < ^ 

v^Ri 

< niEd!^^ — niE {d*)^~^ + eni + o(ni) < eni + o(ni). (19) 

Here the last inequality follows by (0) and (ITT)) . Let v gV \ {R 1 UR 2 ) be a vertex in 
the image of an embedding in Xi. By the definition of i?i and R 2 , br+i{G, v) G {t, s]. 
So using dUl) 


\Xi\<ni ^ EX(iL')I^+i(G,t;*)6(qd - 

Now consider a subgraph of G, Ha = H corresponding to an embedding a G X 2 . 
Ha cannot have an edge va Ei = {xy G G : x G Ri,y G H(G)\ (i?iUi ?2 )} , otherwise 
a would be an element of Xi = Xbad \X 2 - So H {Ha) is contained in i?i U Q, where 

Q = U ViBriG-Ei,v))\Ri. 
v£R2 

Note that since each vertex in Q has degree at most t, |(5| < \R 2 \G■ By Theorem l2.3l 

^ dG{v)^-^ + Y.dG{v)’^-\ ( 20 ) 

v^Ri v^Q 

For the second term we have by 013 

dG{v)^~^ < < |i? 2 |rC“^ < eni + o(ni). (21) 

v^Q 

Combining (IT^ . (1^ and (1^ we obtain 1-^21 < 2eni + o(ni). We have proved 

I'Tfiodl = 1‘T’il + |‘T 2 | ^ {h + 2)eni + o(ni). 

Since the proof holds for arbitrarily small e, we see that n^"^\Xbad\ —^ 0. Thus (Iml) 
follows using (13 and m- 


13 


H). (Iml) applied to H = Ki^h-i yields V.{dn)h-i —>■ lE(d*)/t-i, while 
G^G* shows that (dn)h-i —t id*)h-i in distribution. Thus {dn)h-i is uniformly 
integrable by Lemma 15.11 This implies that for any j = — 1 (d„)j < 

{dn)h-i is uniformly integrable, so by Lemma 15 .1 1 again E((i,j)j — ¥,{d*)j. Using 
S(h — l,j) to denote Stirling numbers of the second kind, 

h—1 h—1 

E {dn)^-^ =Y,S{h- 1, j)E {dn)j 


The next fact is simple and known (cf. Lemma 9.3 of |19jL but we include a 
proof for completeness. 

Lemma 5.2 Suppose Gn —t G* and the degree dn of a uniformly random vertex of 
Gn is uniformly integrable. Write ni = ni(n) = |U(G„)|. Let G'^ he obtained from 
Gn by adding or removing edges adjacent to a set Sn Q U(G„) of size o(ni). Then 
G'n^G*. 

Proof of Example 12.21 It is straightforward that the uniform integrability condi¬ 
tion ([u]) fails for G'n , so the other two conditions also fail by Theorem 12.11 The fact 
that Gn and G'n have the same local weak limit G* follows from Lemma [5.21 □ 

Proof of Lemma 15.21 Denote by Nn the set of vertices in G„ \ Sn which have a 
neighbour in Sn- 

Claim 5.3 For any e € (0,1) there are <5 > 0, no > 0 such that if n > uq and 
0 < |5'„| < Sni then |1V„| < eni. 

Proof Let d and no be such that S < e, Ed„l£i^>o. 5 e< 5 -i < 0.5e for all n > hq. 
Assume that Nn > eni for some n > ng. Write d{v) = dcr^iv). We have 

dn^dn^0.3eS~^ ^ ^ ^ F E {d{v) - 0.5e(5 ^)Id(„) >0.56(5“^ 

V^Sn V^Sji 

> |5'„| I |S'ra|~^ d{v) — 0.5e5“^ | > eni — O.Seni > O.Seni, 

V ves„ J 

which is a contradiction. Here we used Jensen’s inequality and the assumption 
> Nn> eni. □ 

Now fix any positive integer r. As is done in CS], we apply the above claim r 

(r) 

times to get that the set Nn of vertices at distance at most r from Sn in Gn has 
size o(ni). Now G'n —>■ G* follows since 

P(B,(G„, V*) ^ Bn{G'n, V*)) > PK ^ iVM) = 1 - o(l). 


□ 
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6 Proofs for Section [4] 


The radius of a connected rooted graph is the maximum distance from any vertex 
of the graph to the root. 

Proof of Lemma 14.11 Let Hi,H 2 ■ ■ ■ be an enumeration of finite graphs in C/*. 
For positive integers i,n define the event 

B{i,n) = {3j < i : ^ H,)\ > i-^}, 

where rj is the radius of Hj. By the assumption of the lemma n)) —>■ 0 for 

each i = 1,2, _Define Ni = 1, Ni, i = 2, 3,... by taking Ni = l+sup{n > iVi_i : 

¥{B{i,n)) > Let i{n) = max{i : Ni < n}. Now let A = {n : B{i{n),n)}. 

We have P(n ^ A) = ¥{B{i{n),n)) < i{n)~^ —>■ 0. For any sequence of events 
{An,n > 1 } on the same probability space (D, P) we have 

P(nr„>i U„>m An) > limsupP(A„). ( 22 ) 

Thus P(|A| = oo) > limsupP(i3(j(n), n)) = 1. Now by the definition of B{i,n) we 
have Gn —t G* on the event |A| = oo. □ 

Proof of Lemma 14.21 (<^=) The function {G,dioc) —t R that maps (G,u) to 
^B.r{G,v)siH is bounded and continuous for each r > 0 and connected rooted graph H. 

(=>) Without loss of generality we may assume {G„} are dehned on a single 
probability space. Let A be the event guaranteed by Lemma [4.11 Suppose there 
is some e > 0, a bounded continuous function / and an infinite subset of positive 
integers B, such that P(|/(G„,v) — E/(G*)| > e) > e for all n € B. Define a 
random set C = {n G B : |/(G„, u) — E /(G*)| > e}. By (l22ll . P(|A n G| = oo) > e. 
However, G„ —>• G* when n ^ Qo,n G A. This is a contradiction. □ 

Proof of Lemma 14.31 We can assume ni > 1. 

(Op (03i. This follows by Lemma 15.11 since Lemma |4.2I implies convergence 
in distribution of to (d*)^“^. 

(Op ([3(, Imp . Assume (P. Then there is a positive sequence a„ —> 0, such 
that |E(i()“^ — E {d*)^~^\ < an- The empirical {h — l)-st moment of the degree of 
Gn is Dn = nC^hom(iFi,ft_i,G„). Also ^Himot{H))’^-^pr{Gn, H), 

where S consists of graphs in C/* of radius 1. Since ED„ = Ed()“^ it follows by Q 
and dp that Dn A- E So there is a positive sequence £„ —t 0, such that for 

all n 

¥{\Dn-Eid*)'^-^\>en)<en. 

We may assume that > a„. Let the random set G consist of those n for which 
\Dn — E(d*)^“^| > tn- It follows by (|2^ that P(|G| = oo) = 1, P(n S G) —>■ 1 and 
on the event \C\ = oo, —>• E {d*)^~^ as n —>■ oo, n G G. 

Let A be the random set guaranteed by Lemma l4T] On the event |A(~lG| = oo, 
the subsequence of graphs {Gn,n G AdG} satishes the conditions of Theorem l2.ll 
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Assume (ED does not hold. Then there is H' £ TZ{H), e > 0 and a deterministic 
infinite set D of positive integers such that for a\\ n G D 

P(|X„-EX*| > e) > e. 

Here Xn = emh{H,Gn) and X* = emb^(iJ', G*, r*). Let Di D consist of 
those n for which |X„ — EX*| > e. Again, by (I22L we get that P(|A fl G fl I?i| = 
oo) > e. On this event Xn 7 A EX* as n —>■ 00 ,n G An C. This is a contradiction 
to Theorem 12.II diul) . 

It remains to show (Iml) . Write hom(iL, G„). Trivially, X„ < Yn, and 

by Lemma 1^751 Y„ < Dn- So for any t > 0 

^Xnlx„>t < EP"nIv„>t < EA'nlD„>i- 

Since EII„ —?> E((i*)^“^ and A E((i*)^“^, Dn is uniformly integrable, and so 
is Xn- Since X„ also converges in probability by (O, dm]) follows by Lemma ETJ 

The proof is identical to that of the corresponding implication of 
Theorem 12.11 □ 

Proof of Corollary 14.41 Apply Lemma [4.31 □ 

Proof of Corollary 14.51 For non-empty G we haye 

^ hom(P 4 ,G) _ hom(A:i, 2 ,G) , _ hom(A:i, 3 ,G) 

^ emb(A: 2 ,G)’ ^ ’ emb(X 2 ,G) ’ ^ ’ emb(X 2 ,G) ' 

Let S{t,j) denote Stirling numbers of the second kind. By Lemma 14.31 
n-Amb(A: 2 ,G„) AEd*; 

n~^ hom(P 4 , G„) = n“^(emb(P 4 , G„) -b emb(Ar 3 , G„) -b emb(Ar 2 , G„)) 

AE (emb'(P.^,G*)-bemb'(A:',G*)-bemb'(A:',G*)) = E hom'(P^, G*,r*); 

t 

n-^ hom{Ki^t,Gn) = ^ S{tJ) emh{Kij, Gn) A E G*,r*) = E (d*)* 

for t = 2, 3. The claim follows by applying Lemma HAl □ 

Proof of Corollary 14. 6 I Note that 7 rfc(G„) A P(d* = k) > 0 and when TT^iG) > 0 
we haye 

_ E {dGiv2)^dGniA)=k^viv-&G\Gn = G) 

P(dG„K) = fc,^^A 2 *eG|G„ = G) 

= = n-^H{G){kxu{G))-\ (23) 

k{ni - 1 ) %fe(G) 

where H{G) is the number of homomorphisms from P 3 = xyz to G so that x is 
mapped to a yertex of degree k. Denote by Ht{G) the number of such homomor¬ 
phisms where additionally y is mapped to a yertex of degree at most t, and let 

H.iG) = mG)-H,(G) 
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( 24 ) 


Fix S > 0. We will show that for any e > 0 and all n large enough 

-EId.=fchom'(P;(,G*,r*)| > r5) < e, 

i.e. AEId.=fehom'(P;^,G*,r*). 

By Lemma 1321 


^Ht{Gn) — E 


E 

u\uv'^ GGt 


^G-n 


Ah: 


E 



do- (u)IdQ. 



Also hf ^ h* = Elrf.^fc hom^(P 3 , G*, r*) as t —>■ oo since Gn^G* and h* < 
E hom'(P 3 , G*, r*) < oo by Lemma [4.31 Therefore we can pick ti such that for 
t > ti and all n large enough 

p(inr'i?t(G„)-h:i>|)< A |h:-h*i<^ 

and so 

P(|nr'iJt(G„)-h*|>^)<|. (25) 

Next, note that Ht{Gn) < J2vev(Gn) '^Gr.(^)^AG„(«)>*■ So by Markov’s inequality 

P(nr^ihi(G„) > S/2) < 2S-\/^EHtiGn) < 25-^Edlld„>t. 

is uniformly integrable by Lemma 14.31 so there is t 2 such that for all t > t 2 and 
all large enough n 

P(nr^idt(G„) ><5/2) < |. (26) 

Now (1211) follows by setting t = max(ti,t 2 ) and combining (1221) . (1251) and (125|) . □ 

Proof of Lemma 14.71 Let Zi,Z 2 ,--- ^ ~ 1 Di be independent. For a 

random variable X with EX € (0, oo) and its size-biased version X* we have 


E(X*-1), = ^(m-1). 


mP(X = m) E (X)j+i 


i>i 


EX 


EX 


/ = 1,2, 


Thus 


E{Zi)j =E{D 2 )j+i{ED 2 )-^ <oofor/ = l,...,fc. (27) 

Also EZ^ = {E D 2 )~^E < oo (use Remark 13.21 in the case (i)). Write 

(fci ^ fe ) “ kii^'k-r Conditioning on Di, using linearity of expectation and sym¬ 


metry, we get 


E(d*)'= 



•• xEZ’^^ e (0,oo). 


(28) 
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Here the sum is over all j and all tuples of positive integers (fci ,... ,kj) such that 
ki + ■ ■ ■ + kj = k. Write dn = dc^iv*) and recall that v* is a uniformly random 
vertex from V{Gn)- By Theorem 13.1 1 A-G*, so d^ —>■ (d*)^ in distribution. By 
Fatou’s lemma 

E(d*)'= < liminfE^- 

We assume without loss of generality that in the case (iv) the sequences di and 
^2 are symmetric (each permutation of a particular sequence is equally likely). So 
in all cases (i)-(iv) by symmetry Kdc^ivi)^ = Ed(!j, where vi is a fixed vertex in 
V{Gn)- For each of the random intersection graph models we will show 

EdaA^if <E{d*)’= + o{l). (29) 

Let T ~ T{Di, 02 ). Assume that Di = d 7 -(root(T)), Xi,... ,xd^ are the children 
of root(T) and Zi is the number of children of Xi. For each n define a bipartite 
graph (a tree) as follows. On the event Di > n 2 , let be a tree consisting 

of just the root vi. On the event Di < 712 , let iL„ be the subtree induced by 
generations 0,1 and 2 of T, but take only the first Z[ = Zilz^Km-i children for the 
node Xi, i = 1,... ,Di. Label the root vi. Given Di, Zi,..., Zj^i draw labels for 
x\,... ,xdi from V'^ uniformly at random without replacement and draw Z[ distinct 
labels from \ {ui} for the children of Xj * = 1,..., Di, for each i (conditionally) 
independently. Here G* = Vi{Hn) is the set of first rn vertices of the fixed ground 
set V* as in Theorem 13.II 

Write dn = I_Di<n 2 A notice that dn is an upper bound on the degree 
of vi in the resulting intersection graph. We have as in (|2^ 

^^ Je(^^')i 77,<„,E(A)'=^ x...xE(A)'=* 

= E(d*)'=-o(l). (30) 

Here we used (1271), (EHl) and bounds 

E{Di)jlDi<n 2 =E(L>i)j - 0 ( 1 ); E{Z[y = EZ{ - EZ{lz^-^m-i = EZ{ - o(l), 

valid for any 3 < k hy Lemma ISTl Thus it suffices to prove that Eda^ivi)^ < 
Ed^ + o{l). Recall that is the bipartite graph underlying the intersection graph 
Gn- Call a path xyz good ii x = vi, y € V 2 and z £ Vi \ {ui}. We have 

do^ivi) <'^Kviwv,Hn) and dn ='^Hviwv, Hn) 

where the sum is over all good paths viwv and 1{F, H) is the indicator of the event 
that F C E{H). 

For any graph F[ we denote v{F[) = \V{F[)\ and e(iL) = \E{H)\. If H is 
bipartite Vj{El), j = 1,2 denotes the size of j-th part V) of E[. Define an equivalence 
relation between bipartite graphs H' = {V{, V^, E'), H" = (V{, V^, E'): H' - H" if 
and only if there is an isomorphism from EE to H” that maps to V”, j = 1,2. 
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Let iFk consist of one member for each equivalence class of all graphs formed from 
a union of k good paths (a not necessarily disjoint union of graphs Gi = {Vi , Ei ) 
and G 2 = (V 2 , i? 2 ) is a graph {Vi U V 2 , U i? 2 ))- For H' G let N{H') be the 
number of distinct tuples of k good paths whose union is a bipartite graph H" with 
parts V(' C and V 2 C such that H" ^ H'. It is easy to see that there are 
positive constants c{H'),C{H'), such that for all n large enough 

N{H') = + o(l)). (31) 

By linearity of expectation 

I{viWV,Hn))‘^ = ^ N{H')EI{H',Hn)- 


and similarly 

Using (l30)) . (I 3 T]) and the fact that Jfe is finite, in order to prove (l2^ it suffices to 
check that 

+o(np^^'^+^) for each iJ'G 7"^. (32) 


So fix any H' G Tk- Suppose V 2 {H') = t and the degrees of vertices in V 2 {H') 
are bi,... ,bt. Note that bj < k + 1 for j = 1,... ,t. Conditioning on Di and the 
positions of generation 1 nodes labelled V 2 {H') and using (l?7l) 


ft \ _ T 17 (^1)61-1 ,, {Z't)bt -1 

{n2)t (ni-l),,_i ■■■ (ni-l)f,,_i 

t 

= E (Z'),,_i(l + o(l)) 

t 

= D 2 )-* n IE {D 2 )b, (1 + o(l)). 



(33) 


Now if H' is a tree then e{H') = v{H') — 1 and (15^ follows if 

EI(if',iL„) <EI(iJ',iL„)(l + o(l)). (34) 

Meanwhile, if H' has a cycle then e{H') > v{H') and EI(iL', Hn) = ^), so 

(EH) follows whenever 

EI(iL',iL„) = o(n-"(^')+^). (35) 

We now consider (15^ for each model separately. 

(i) (active intersection graph) Let ai,..., Ug be the degrees of vertices in Ui(iL'). 
We can assume oi = du' (ui) = t. Of course, aj < k, j = 1,..., s. Since the vertices 
in Vi{H') choose their neighbours independently. 


EI(iJ',iJ„) =E[]; 

2=1 


{n2)ai 


= nr^^'^l[E{Di)a,{l + oil)). 
2=1 
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If H' has a cycle then e{H') > v{H') and Hn) = O ^ j, so (1551) holds. 

By Remark [321 D 2 ~ Poip-^EDi). So E{D 2 )b^ = (p-^EDi)^'. Thus ([Ml) 
reduces to 

= (/?ni)-"(^')Epi)t(Ei^i)'=(^')-‘(l + o(l)) 

If H' has no cycle, then Oj = I for all j > 2. Thus 

EI(iJ', i?„) < pi)t(E + o(l)) 

and (1351) follows. 

(ii) (passive intersection graph) Since hi < k + 1 for i = 1,..., t by the assump¬ 
tion (ii) of the lemma 

EI(iJ', if„) = E n n E p 2 )b.(l + o(l)). 

Using Remark 13.21 Di ~ Po(/ 3 ET> 2 ), so E(£>i)t = /I*(ED 2 )*. Therefore (l33l) 
reduces to 

S 

EI{H', Hn) = H E { 02 ) 1 ,, (1 + o(l)) 

i=l 

and (1321) follows. 

(Hi) (inhomogeneous random intersection graph) Let : u € V{H')} be 
independent random variables such that ~ for u € Vi(H'), i = 1,2. Write 
a Ab = min(a, b). Then 

EI(ii-',iJ„) =E TT 

^ M (1 + o(l)). 

uev{H') 

If H' contains a cycle, then by the assumption that and are 

finite, we get that EI(iL',iL„) = O so (1551) holds. If H' is a tree then 

t 

El{H', Hn) < Y[ E ^ 35 ) 

i=i 

Using R,emarkl521 we have £>1 ~ and D 2 ~ Fo(/3-^/2^(2)EC(i)), 

so E{Di)t = /3*/2E(e(i))*(Ee(2))t E{D 2 )j = /J-^'/^E^j. j < 
fc -k 1. Putting these estimates into (155)) and simplifying we get the expression on 
the right of (1551) . (1551) follows. 

(iv) (random configuration graph) For i = 1,2, let 

rii 

^i,m — ri^ ^ 

1=1 
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By the assumption (iv) of the lemma there is a function 0 such that for 

i = 1,2 and m = 1,..., 2fc + * — 1 

P(|4^-E(A)m| > /;) =o(nr'"). (37) 

Recall that N = is the total number of half edges in each of the parts. 

Since ni,n 2 —>■ oo and P(|iV — EDini\ > /nU-i) —>■ 0, we can assume without loss 
of generality that there exists —>■ oo, such that for all n 

ni,n2,N>LOn] u!n>k + 2. (38) 

The probability that contains H' as a subgraph is at most 


\ ^ > uev(H') j 

Here the product counts the number of ways to choose particular half-edges forming 
H' . Let (uj, ... ,v*) and ... ,u*) be independent uniformly random tuples of 
distinct vertices from Vi(i7„) and V 2 {Hn) respectively. Using symmetry, (1571) . (1551) 
and the union bound 


a{H')=E I 1 A l[^dHAv*))b, 


{N)e(H') 




(jv),(g,) n n (1+0(1)) 


= (E n n ® (^2)b, (1 + o(l)) + 

i=l j=l 


Again, if H' has a cycle then (1551) follows since 2k > v{H') — 1. Otherwise, if H' is 
a tree, then since EUini = EZ? 2 'ra 2 (l + o(l)) we have EDi = I3ED2 and 


(E77i)-^(^')nE(77i)„, =E(77i)*^^^i]^ =r‘E(i^i)t(IEi?2)-‘. 


i=l 


By comparing a{H') with (1551) we see that (1541) holds. 


□ 
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A Proof of Theorem 13.1 


We need different proofs for each of the graph models. The common part in these 
proofs is that the local weak limit for a sequence of random intersection graphs {G„} 
that we study is determined by the limit for the sequence of underlying bipartite 
graphs {Hn}. 

For rooted graphs G,H G Qt, we denote Pr{G,H) = ]lB^(G,root(G))s_y- If G is 
a random rooted or unrooted graph, we will write Pr{G,H) = ¥,pr{G, H). Thus 
Pr{G,H) is the probability that an r-ball rooted at a random vertex is isomorphic 
to H; if G is unrooted, then the ball is centered at a uniformly random vertex, 
while if G is rooted, then the ball is centered at root(G). 

Let {Gn},Di,D 2 be as in Theorem 13.11 Let be the bipartite graph corre¬ 
sponding to Gn- Since Br{Gn,v) is determined by B 2 r{Hn,v) for v € F(G„) and 
the subset of graphs in C/* that have radius at most 2r is countable, to prove the 
theorem it suffices to show in each of the cases that for each positive integer r and 
each connected rooted graph H 

4p,(r,iL). (39) 

For the proofs we need to generalize the notation pr,Pr- Let G be a random graph 
with at least k vertices. Let Hi,... ,Hk be rooted connected graphs. We write 

Pr(G, Hi, . . . , Hk) = E |G), (40) 

PriG,Hi,...,Hk) = EpriG,Hi,...,Hk). 

where (u*,...,u^.) is a uniformly random tuple of k distinct vertices from V{G) 
(given F(G)). 

When G = {Vi,V 2 ,E) is bipartite with |Fi|, IV 2 I > k, for i G {1,2} define pr^ 
and similarly as Pr,Pr, but taking a uniformly random tuple {vl,...,vl) of 
distinct vertices only from the part Vi. 

A.l The active and passive models 

For i G {1, 2} define z = 2 if i = 1 and i = 1 if i = 2. Let (F, S) be a pair where F is 
a forest of labelled trees, and S C V{F) a subset of its leaves (we call a node a leaf 
if it has degree at most 1). Assume further that F can be represented as a bipartite 
graph with parts Vi{F) and V 2 {F) with Vi{F) C V®. We denote the collection of 
all such pairs {F, S) by A. 

For a random graph Hn of Theorem 13.11 and {F, S) G A, we will denote by 
A{Hn,F,S) the event that F is a subgraph of and for any v G V{F) \ S we 
have Th^ (v) = Fi7’(u). We will call vertices in S active and the vertices in V(F) \ S 
closed. 

For a discrete random variable X, denote by D{X) = {x : P(A = x) > 0}. 
Recall that X^ = dH„{z) is the degree of z in iJ„. 
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Lemma A.l Let {Gn}, I?i be as in Theorem \3.lY i]. Then Hn satisfies w 
with D 2 ~ Po^P ^Di). 

Furthermore, let i G {1,2} and let k be a non-negative integer. Let {F, S) € A 
and let z € S. Call {F, S) feasible if for all u € (V(F) fl V^) \ S we have dp^u) € 
D{Di). Let A = A{Hn,F,S). 

Assume {F, S) is feasible and z € S is of type i (z €. V*}. Then for all n large 
enough P(A) > 0, 


and 


limP(Az = k\A) 


^{Di=k), ifdF{z) = l, 
F{Di = k), ifdF{z) = 0, 


limP(3'u € V{F) : zu G Hn, zu ^ F\A) = 0. 


(41) 


(42) 


Proof Fix a feasible {F, S). Let us first show dH). We use the notation of Section^ 
By symmetry has the same distribution as the degree in Hn of any fixed vertex 
w of V"^. By independence of Sy, v G V^, has distribution Bin{ni,p) where 
p = p(n) = P(ii; G Sy^) = and vi is a fixed vertex of V^. So E ^ 

and 5n = '' — —>■ 0 as n —>■ 00 . Let D 2 and have Poisson distribution 

with parameters and nip respectively. By Le Cam’s theorem and by the 
properties of Poisson random variables (see, e.g., m). we have 

dTv{X^^\D2) < dTv{X^^\Yn)+dTv{Yn,D2) (43) 

< 2nip^ +F{Po{\6n\) >0)^0. 

Therefore (|4]) holds with D 2 ^ Po{P~^EDi). 

Let L be the set of leaves in F. We will show the rest of the claim by induction 
on |C(F)| + |L \ S'!- First suppose F is an empty graph and S' = 0. Then trivially 
F{A{Hn, F, S)) > 0 and (HTl) . hold. Now suppose I > 1 and we have proved 
the claim for all feasible {F', S') G A such that |P(F')| + |L\ S| < /. Let {F, S) G A 
be feasible and such that |C(F)| + |L \ S| = /. 

We begin by proving that 

¥{A{Hn, F,S)) >0 for all n large enough. (44) 

Indeed, if F has a closed leaf v, (HI)) follows by induction and (HD) . (H^ applied 
to {F,S U (uj). Else, if F has an active vertex u G S of degree zero, then it is 
trivial. Otherwise all leaves are active. Let v be the neighbour of one of the leaves. 
The set Ly of the children of v (vertices in F f{v) which are not on the path from 
V to the root) satisfies Ly C S. By induction A{Hn,F — Ly, (S \ Ly) U (uj) holds 
with a positive probability. Also using induction, (HD) . (H^ imply that v has \Ly\ 
neighbours in G — V{F — Ly) with a positive probability. Symmetry implies (1441) . 

In the rest of the proof we assume n is large enough that (|4^ holds and show 
65) and (1421) for {F, S) and an arbitrary z G S. Recall that F is a bipartite graph 
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with parts {Vi{F),V2{F)), Vi{F) C V\ i = 1,2. We assume that n is large enough 
that Vi{F) = |T^i(F)| < m, i = 1, 2. For any u € and wi,W 2 G we have 


¥{uWi,UW2 S Fin) < 




n2 


= o(n2^). (45) 


Here the last bound follows since is uniformly integrable, see e.g., [5] or 
Lemma [Hi- 

First consider the case z £ V^. Recall that Sz = ri/^(z). We may write 
= dpiz) + X' + X'J where Xi = \Sz\ R(F)| and X" = |F(i^) \ TFiz)\. 

Let {rci,..., Wr} = V 2 {F) \ S he the set of closed type 2 vertices in F. For 
u £ \ V{F) denote by Bu = 5«(n) the event that uwi,... ,uwr ^ Hn- Write 

A = A{Hn, F, S). By independence of v £ V^} 


an := P(uz € Hn\A) = P{uz\Bu) (46) 

and note that a„ does not depend on m G \ V{F). Furthermore, we can easily 
verify that for any distinct ui,... ,ut £ \ V{F) 


P(uiZ, ...,UtZ£ Hn\A) = ]^P(z € Sui\Bui) = an, 

i=i 

i.e., the events uz £ Fin for u £ \ V{F) are conditionally independent given A. 

Therefore conditionally on A, the random variable X' is distributed as where 
X'^ ^ Bin{ni - Vi{F),an)- 

Let us estimate an- For any u £ \ V{F) we have using (H51) and the union 

bound 

rEX(^) 

P{Bu) > 1 - rP(uwi G Hn) >1-= 1 - o(l); 


{uz £ Hn,Bu) < V{uz £ Hn) = 


«2 
EX(i) 
n2 


EX(^) 

{uz £ Hn, Bu) > P{uz £ Hn) - rP{uZ, UWi £ Hn) = - (1 - o(l)). (47) 

n2 


= ( 1 - 0 ( 1 )) = ^( 1 - 0 ( 1 )). 
n2 n2 


So by ([3|) and (|46|) 


Suppose Zn ~ Bin{ni,an). Similarly as in 

dTv{X'j^, D 2 ) < dTv{X'j^, Zn) + dTv{Zn, D 2 ) < Vl{F)an + 0 ( 1 ) —>■ 0. (48) 

Now consider X'J. Suppose m G (^ fl V^) \ Ff( 2)- If dF{u) = 1, we may assume 
without loss of generality that the neighbour u' of u in F. Write W = {rci,..., Wr}\ 
{u'}. Using independence and (l45|) and (lT7)) 

IPf'z/2^ 'ii'u! ^ H ) 

P{uz £ Hn\A) = V{uz £ Hn\uu' £ Hn,C^weW’UW ^ Hn) < ^- . " s = o(l). 

r{r\u,ew'Uw ^ Hn) 
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Similarly, if dp^u) = 0, 


P{uz G Hn\A) < 


F{uz G Hn) 
P(ltri;i, . . . , UWr ^ Hn) 


n2 


( 1 + 0 ( 1 )) 


0 ( 1 ) 


Thus by the union bound P{X'J > 0|+) = o(l): this yields (1421) when z is of type 
2. Using ((48l) we conclude for fc > 1 and z € S DV^ 


P(^Z 


k\A) 


V{Po{PDi/(3) = k), if dp{z) = 0; 
P{Po(P Di/(3) + 1 = fc), if dp{z) = 1. 


Since Po(A)* ~ 1 + Po(A) (see Lemma 1X31) . (HD) follows when z G S' n V^. 


Now suppose z G V^. We will prove the claim only in the case dp{z) = 1. 
The case dp{z) = 0 is similar, but simpler. Let z' be the only neighbour of z in 
F, and let {wi,..., Wr} = 1^(+) \ (S U {z'}). Write B = B{n) for the event that 
zwi ,..., zwr ^ Hn ■ We have 

P{X, = k\A) = P{X, = k\zz' G H„, B) 


and using symmetry 


P(W, = fc, zz' G Hn, B) = P{zz’ G Hn\B, X, = fc)P(X, = fc, B) 

= —^(P(i^l =fc)+o(l)). 
n 2 — r 


The estimate P(X^ = k,B) = V{Di = fc) + o(l) follows since by the union bound 
and (|45l) . 0 < P{Xz = k) — P{Xz = k,B) < rV{z',wi G Sz) = o(n^^) and P{Xz = 
fc) —>• P(iAi = fc) by dH). Therefore using also (1^ 


P(X, = fc|Al) 


r{Xz = k,zz' eHn,B) 
P(zz'€Hn,B) 


kP{Di = fc) 

ElDi 


F{Dl = fc). 


Finally, we show (SH), also only the case dp{z) = 1. This is trivial if s = |SnV^| = 0. 
If s > 1, fixing any u>o G \ {z'} and using (H51) and (ITTl) 


P(|S.nU(F)| > 2|Al) < 


iuG(Sny2)\{2'} 


{wz,zz' e Hn,B) 
P(zz' eHn,B) 


The next lemma follows using a lengthy but essentially trivial argument. 


□ 


Lemma A.2 Let {Hn} be as in Lemma \A.l[ Fix i G {1, 2}, a non-negative integer 
r and let Ci, ..., Ck be connected rooted graphs of radius at most r. Then 

k 

pW(i/„, (49) 

i=i 

where T = F{Di,Dj) and Di,D 2 are given in (01, m- 
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Proof The lemma is trivial for r = 0, so assume r > 1. First assume Ci,..., Cfc 
are such that n,tiP.(r, Cj) > 0. Then Ci,...,Ck are rooted trees; we can 
also assume they are vertex disjoint. We couple a BFS search process with the 
construction of the first r generations of k independent copies of T in the natural 
way as follows. Let F' = C[\J- ■ -UC^ be an arbitrary embedding of F" = CiU- • -UCfc 
into plane. Let vi,V 2 , ■ ■ ■ ,vthe the vertices of F' in the order they are visited by the 
breadth-first search that terminates at level r — 1 and let dj = dp'ivj). (The BFS 
starts with the root vi of C'^ and visits each vertex in Br-i(vi). After exploring 
C' with j < fc it jumps to the root of The degrees of each tree are listed in 

the top-to-bottom, left-to-right order, where the root of a tree is at the top, and 
the children of each node are ordered from left to right). 

To this exploration of a fixed forest associate a “truncated” BFS exploration of 
the graph iF„. 

At step 1 we choose as a root of Gi a uniformly random vertex u* from and 
reveal its neighbours Sv* in Fin- Define V* = {u*} and = u*. We say that 

step 1 succeeds in iF„ if X^, = di- 

Now let j £ {2,3,, t}, and assume the steps 1,... ,j — 1 succeeded. At step 
j we do the following. If Vj is not a root node in F' then let v be the parent of Vj in 
F'. Let u = (jiiv). Choose x uniformly at random from Su \ Reveal d' := X^, 

the degree of x in and the remaining X^ — 1 elements of the set S{x) = F//„ (x) 
and say that step i succeeds if d' = dj and no vertex in S(x) \ {u} was revealed 
before step j. Finally set V* = V*_^ U {x}, 

If Vj is a root node, then we let x be a uniformly random vertex from R* \ Rj-i, 
where Rj-i = {it £ V{Hn) : P < j — I, 4’{'^p) = u,Vp £ R} and where R is the set 
of the root nodes of F'. We reveal S{x), set (p(vj) = x and define V* = V*_i U {x}. 
We say that step j succeeds if d'j := X^ is equal to dj and no vertex in S{x) U {x} 
was revealed in previous steps. 

Finally, in the third process we simply generate the first r generations of k 
independent copies of the branching process F{Di, Dj). We query the random 
number of children for each node in the same order as the BFS search process on 
F. If the degree d" of a particle queried at step j is equal to dj, we say that the 
step j in the branching process succeeds. 

Let Sj be the event that step j succeeds in id„. Let Uj_i be the set of vertices 
revealed until step j, i.e., Uj_^ = V*_i U {m : 3ii G ■ uv £ Fin}. 

Let F'_^ be the subgraph of F' induced on vertices {vi,... ,Vj-i} and their 
neighbours in F'. Let Fj-i be an arbitrary fixed bipartite graph with parts 
Vi{Fj-i) C V® and V 2 {Fj-i) C V® such that Fj-i ^ Fj-i- Recall that two bipar¬ 
tite graphs satisfy Hi ^ H 2 if and only if there is a part-preserving isomorphism 
between Hi and H 2 ', fix such an isomorphism aj from Fj-i to F'_^. Also define 
Tj-i ■= V{F'_^)\{a(v) : v £ {vi,... ,Vj-i}} 

First suppose Vj is a root node of F'. Since v{F') = |R(F')| is constant, using 
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an inequality |P(i3)—P(i?|C')| < P(C') valid for arbitrary events B, C with P{C) > 0 

\F{s,\Si, Vi) - nSj\Hvj) ^ u;_„Su..., Vi)| 

< P(<^(z;,) G u;_,\s,, 5,_i) < ^ = o(l). 

TLi 

Fix z G V® \ V{Fj-i), and let be the graph consisting of Fj-i and an isolated 
vertex z. Let Cj be the event that dn^iz) = dj and there is no edge from z to 
ViF,_i) in iL„. 

Let Aj = A{Hn, F^_^,Tj-iU{z}). Conditioning on the vertices (j){vi ),..., 
using symmetry, independence of v* and Fin and Lemma rA.il we get 

P(5,|</>(n,) ^ , 5,_i) = nC,\A^) ^ P(A = d,). 

Thus aj = P(S'j|5'i,..., Sj-i) —?► F{Di = dj). 

Now suppose is a non-root node of F"'. In this case let Aj = A(iF„, Fj_i, Tj_i) 
and let z G V{Fj-i) be such that a{z) = Vj, and suppose z G V^. Then using sym¬ 
metry, Lemma lA.ll similarly as above 

a, = P(A, 1^1,..., Vi) = nCAA,) ^ ¥{d; = d,). 

Since the number of vertices in F is constant, we get Pr^"!,..., S't) —!> Hj 
which is exactly the probability for all steps in the branching process to succeed. 
Since this holds for each embedding F' of F, (|4^ follows. 

Finally suppose Y\^j^iVr{T,Cj) = 0 (i.e., some Cj has a cycle or a vertex of 
degree not consistent with D{Di)). Let 7^ be the family of all unlabelled rooted 
trees T of radius at most r such that Pr{F,T) > 0. Clearly % is countable, so 
applying (|4^ to partial hnite sums we get 

p«(i7„,Ci,...,C'fe) = l- ^ p«(i7„,Cl,...,C'^) 

C[,...,C',&Tr 

k 

^1- E = 


Proof of Theorem l3.1l (iL (iil We first show fib LetT = T{Di, 02 ). Lemma lA.2l 
implies that for any set of rooted connected graphs Ci ,..., Cfc 

k 

Fpr{Gn, Cl, . . . , Cfc) = P,(G„, Cl, ... , Cfe) ^ n (50) 

i=i 
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For V let be the indicator of the event Br{Gn, v) = C. Denote p = p{n) = 





Since Epr{Gn,Ci) —>■ PriGp^Gi), we have pr{Gn,Ci) A PriG'r,Gi) by Cheby- 
shev’s inequality. 

The proof of (ii) follows analogously, but using i = 2 in Lemma [4.21 Indeed, if 
Hn = (D^, F) and G{Hn) is a passive random intersection graph with = |D*|, 
then G{H^) with H'„ = {V'^,V^, F) is an active random intersection graph with 
parts of size n'l = n 2 and = ni. In particular, in this case P' = lim ^ = I3~^ 
and by Lemma [Ql (1^ holds with Di ^ Po{D 2 /P') ~ Po{l3D2). □ 

A.2 The inhomogeneous model 

We will reduce Theorem Id.ll iiil to the general model of Bollobas, Janson and 
Riordan [niiis]- Results of [H] have also been applied by Bloznelis m to study 
the largest connected component in an inhomogeneous random intersection graph. 

Let § = {S,p,) be a probability space and let k = kk^ '■ [0,oo) be a 

measurable function, or a kernel. The random inhomogeneous graph G(n, k) on 
the vertex set {ui,..., Vn} is obtained by sampling independently at random points 
xi,X 2 , ■ ■ ■ ,Xn from the distribution p. Then for each orderccj^ pair {vi,Vj) we 
add an edge ViVj to G(n, n) with probability pij = min( ^^^^’^^^ , I), independently, 
merging any repetitive edges. For i G {1,..., n} we call Xy. = Xi the typ^ of Vi. 

The associated Galton-Watson branching process is defined as follows [19]. 
There is a single particle in generation 0. The type of this particle is chosen from 
S according to the distribution p. The children of each particle P of type x have 
types which are the points of a Poisson process with intensity 2K(x,y)dp{y). The 
children for each particle in the same generation are generated independently, and 
all the children of generation i particles make up generation i + 1. We use the same 
notation X^ to denote the corresponding random possibly infinite rooted tree. 

Although the work [18] probably contains what is necessary for our proof, it is 
simpler in our case to use the results of m- These results also hold in a more general 
framework where kernels for arbitrary small subgraphs (for example, cliques) are 
allowed. However, a single kernel function (corresponding to the subgraph K 2 ) is 

®For our simple application it would be more natural to add each unordered pair with 
probability 2pij. The results are equivalent, but for the sake of consistency we stick to 
the definition of |19 |. 

^In this section we use the notion type to refer to an element from S, not to the part 
of a vertex in a bipartite graph. 
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sufficient to model the underlying bipartite graph Hn of the inhomogeneous random 
intersection graph. 

Lemma A.3 (Theorem 9.1 of U9f } Suppose k is integrable on S"^. Then for any 
non-negative integer r and any rooted eonneeted H 

Pr{G{n, k), H) a H). 

The proof of [19] is elegant and based on approximation by a bounded kernel and 
embedding the inhomogeneous branching process into a homogeneous process. To 
apply G{n, n) to random bipartite graphs, we need a minor technical modification of 
the above lemma (we omit the proof). For a graph G, a set A C F(G) and a rooted 
connected graph H, let Pr{G, H, A) denote the probability that Br{G,v*) = H, 
where v* €„ A (define Pr{G, H, A) := 0 if A is empty). 

Lemma A.4 Suppose k is integrable on S^. Then for any non-negative integer r, 
any rooted connected H and any measurable set A Q S such that p,{A) >0, if A 
denotes the set of vertiees v of G(n, n) sueh that Xy € A then 

PriG{n,K),H,A) ^PriX,,A,H). (51) 

Here denotes conditioned on the event that the type of the root is in A. 

We need a simple fact about size-biased mixed Poisson distributions. 

Lemma A. 5 Let A he a non-negative random variable with 0 < E A < oo. Suppose 
X ^ Po(A). Then the corresponding size-biased random variable satisfies 

X* ~ Po(A*) -h 1 

Proof The characteristic function of X is 

(fxit) = = (()a(-*( 1 - e*‘)). 

Also, EA = EA. If a random variable Z with 0 < EZ < oo has a characteristic 
function (fz, then the characteristic function of Z* is {iK Z)~^(j)'^(t), see, e.g., [3]- 
We see that the characteristic function of A*, 

</.x.(t) = (iEA)-AVA(-*(l - e*‘)) 

is equal to the characteristic function of A ~ Po{A*) -\- 1; 

cfvit) = PVa*(-*( 1 - e*‘)) = (iEA)-ipVk(-*(l - e**)). 


□ 

Let (3 > 0, be as in Theorem IS.lI iiib We assume are 

independent. For i = 1, 2 define = (1 + 
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Consider a random iirhomogeneous graph G{N^ k) where S = (5, /i) and k = 
Kp : ^ [0,1] are as follows. Let S = {1, 2} x R, and let /r be a measure induced 

by the random vector 

X = itx,wx) = (1+I,lfi + (1-I)l2) (52) 

where I is a Bernoulli random variable with parameter (1 + (3)~^ independent of 
{Cr,C 2 }- Finally, set for x,y £ S 


K{x,y) 


2^x'^y-i if G ^ 

0, otherwise . 


(53) 


We consider the graph G{N, k) as a bipartite graph {V^,V^, E), with consisting 
of all vertices Vj such that ix^ = i- 


Proposition A.6 Let be the branching process corresponding to G(N, k) de¬ 
fined above. Let X = {ix,wx) be the random type of the root of Let T ~ 
T{Di,Dj), where for i = 1,2 

A Po(c,e^*)iEe®), Cl = 

Then for any i £ {1,2}, non-negative integer r and a rooted tree T of radius at 
most r 

P(B,(A„) ^ T\tx =i) = Pr{T, T). 


Proof Recall that X has unconditional distribution y defined in (l52l) . Let X 2 be 
an independent copy of X. 

We have P(*x = i) = Qi, where = (1 + /3)“^ and q 2 = /3(1 + /3)“^- Write 
s = i. By definition, conditioned on A = x = {i,w), the types of children of the 
root are points of a Poisson process with intensity 


yi,w{z) = 2K{x,z)dy,{z) = wwzli^i,dy,{z). 


Thus, given X = x = {i,w), the number dop of children of the root is distributed 
as 


Po(icE ) = Poiwq^E ). (54) 


Given X = {i,w), do.i = k, the types of the children of the root are independent 
elements Ai^,..., Ai^^ of S', where Ai j = (s, Wi j) and Wi j has distribution pLg 
given by 


Ts{A) 


X 4 Wxqstdy^i^^) (t) 


(E|(^))-i^td/i^-(.,(t) 


(55) 


for each Borel set A. Here we used the well known property on the distribution of 
the points in an inhomogeneous random process given that the number of points is 
k, see, e.g. [53]. Note that yg does not depend on w and it is the distribution of 
the size-biased random variable 


= (l + /3)i/2y3-i/4 
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Thus, using (l54l) and the fact wx 
P(do,i = k\ix = i) = 


^ is independent of ix, for any fc = 0,1,... 
(IIc;o,i=fcIix=i|^x) 


= P(Po(wxg*E|®) = k)= P(A = k). (56) 

Also, by (1551) for any fc = 0,1,... and {ti,... ,tk) G 


P(W^i,i < ti, • ■ •, hFi.fc < hlix = i, do,i = k) 

^ EE ij.Wi,i<ti,...,Wi^k<tk^ix=i^da,i=k\wx) 
E =^zlldo,i—fc 


np(ff <i,)- 

t=i 


(57) 


(Here, to formally establish the connection of ((5^ and (1551) with the conditional 
expectations in (1551) and (1571) respectively, construct the probability space for 
as a countable product of spaces generated by wx and a sequence Z of other 
independent random variables, see, e.g.. Example 3.6.8 of [24]. Then define a 
measurable function 0, so that g{'w) = (j){w,Z) is the random variable of interest 
when we consider wx = w fixed. Here the random variables of interest are of 
the form ldo,i=Ax=i and i<ti, •.., Iw'i,fc<ifcIdo,i=fcIix=i- Finally, use Radon- 
Nikodym’s theorem, see Example 5.1.5 of [24].') 

The trees T and can both be considered as random rooted plane trees, i.e., 
rooted trees where the offspring of each node is ordered. Below we consider the set 
of generation r vertices of a plane tree T as ordered (e.g., by the discovery times of 
the breadth-first search in T). For a plane tree T, we let Br{T) denote the plane 
tree induced on the generations 0,... ,r of T. For two rooted plane trees Ti,T 2 we 
write Ti = T 2 if there is an isomorphism from Ti to T 2 preserving the root and the 
ordering for each vertex. 

Let S{r) be the sequence of vertices at distance r from the root in For 
r = 1, 2,..., let j if r is odd and i if r is even. To complete the proof, 
we show the following claim by induction on r: for any finite rooted plane tree 
T of radius at most r, and any z = 1,2, P(i?r(A’„) = T\ix = i) = F(BriT) = 
T); furthermore given ix = i and BriX^) = T, the types of vertices in S{r) are 
conditionally independent and distributed as 

(58) 

Indeed, we have already shown the claim for r = 1. Suppose r > 2. Let T be 
a rooted plane tree of radius at most r. If the radius of T is less than r, we 
are done. So assume there are m > 1 and q > 1 vertices in generations r and 
r — 1 of T respectively. Let Tr-i = Br-i{T). Denote by Xaj = {Ia,j,Wa,j) the 
type and by daj the number of offspring of the j-th vertex in S{a). Of course, 
for any j = l,...,q P(/r-ij = -ir-i|Hr-i(A’„) = Tr-i,ix = i) = 1. By ([511) 
and induction, the conditional distribution of {dr- 1 , 1 , ■ ■ ■ ,dr-i,q) given ix = i and 
Br-i{X,^) = Tr-i, is the same as the distribution of a vector of q iid random 
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variables {rji,... ,r]g) 


Tj, ^ Po ^ Po ((!(*’'-)) ~ Dl - 1. 

Let Uj be the number of children in T of the j-th vertex in generation r — 1 of T. 
We have 


P(d 7 '_i^l — yi, • ■ • , dr—l^q — Vql'i'X — L — ^r—l) 




=Tr-i 111=1 


=yj 


(A:’^)=Tr_i 


(59) 

i=i 


By (l59l) and induction 

P(Br(df«;) = T\ix =i) = F{Br-i{X^) = Tr-i\ix =i)x 

P(d7'_in = yi; • ■ • ; dr—lj = yq\^X — L -^r — l(d^K) — Tr — l) 

= P(S,_i(r) = r,_i) f[ F{Dl_^ = y,) = F[B,[r) = T) 
i=i 


Using (1551) . we see that given ix = i, B^-iiX^) = Tf-i, X^-ij = (ir-i,Wr-ij) 
and dr-ij = y^, j = 1,..., g, the types of offspring of each vertices in S{r — 1) are 

independent and distributed as (U, ). By the conditional independence of 

the offspring of S'(r — 1) we get, integrating over Wr-i i, • ■ •, Wr-i g, similarly as in 
dSZl), that for any ti,... ,tm G R 


FiWr,l < tl, . . . , Wr,m < tm\lX = i, T, = T) = [] < t,). 

i=i 


The measure theoretic details concerning conditional expectations here may be 
filled using a similar argument as above. □ 


We will need the following fact based on ideas from m- The statement and 
the proof would also hold more generally when k is an “integrable kernel family”, 
see m- Recall that SAS' denotes the symmetric difference between sets S and 
S'. We will apply the lemma with small t. 

Lemma A.7 Fix a rooted connected graph H, a non-negative integer r and t > 0. 
Let G{n, k), A and A be as in Lemma \A.4\ Let G'.^ he another random graph and 
X a random subset ofV{G'^). 

Write Av = U(G(n, k))AU(G'„), Ae = E{G{n, k))AE{G'J, Aa = AAA'. 
There is S > 0 such that */P(|Av| + + |Ayi| < Sn) —>• 1, then 

P(|pr(G'(n), iJ, A') - ch\ > t) -)■ 0, 


where ch = TPr{X^^A,H). 
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Proof We may assume t < 0.1. Write a = /i(A) and fix e € (0, ^acnt). 

Call a vertex v of Gn = G(n, k) bad if it is in Ay U or it is incident to an 
edge in Ae- Let B the set of all bad vertices. Then \B\ < |Ay| + 2|A£;| + |A^|. 

Lemma 9.3 of [H] states that for each e' > 0 we can find S' > 0 such that for 
each set Q of size at most S'n the number of vertices in G„ at distance at most 1 
from Q is at most e'n with probability 1 — o(l). 

By r applications of this lemma if follows that if <5 is sufficiently small and 
|Ay| + |A_e| + |A^| < Sn then the set S of vertices v G V(G(n,K)) at distance 
at most r in G(n, k) from B satisfies < en with probability 1 — o(l). We can 
assume that S < e. 

Call a realisation of (G„, G^) bad if 

• [S'! > en; 

• or \pr{Gn, H, A) - ch\ > e; 

• or ||A| — an| > en; 

Then {Gn,G'„) is good whp (with probability 1 — o(l)): the first event holds 
whp by the above argument, the second event holds whp by Lemma lA.4[ finally, the 
third event holds whp by the law of large numbers, since |A| ^ Binom{n, p{A)). 

We will write as a shorthand x = 6±c for x G [6—c, b + c]. We write x±b = c±d 
for [x — b,x + b] C [c — d, c + d]. 

Suppose (G„,G(j) is good. Then for each vertex v of G„ such that v ^ S we 
have Br{Gn,v) = Br[G'^,v). Write A(G, i) = J2veA'^BAG,v)siH- Then 


N{Gn,A) 

Pr ■= -=- 

|A| 


CH±e 


and 


N(Gm A) = acjjn (^li—')= acnnil it). 
V a/ V CH/ 


Further, since (Gn,G'„) is good, \N{Gn,A) — N{G'.^,A')\ < [S'! < en; ||A| — |A'|| < 
Sn < en and ||A| — an| < en. Thus we have 


lA'l |A|±«ii 

= ASA)(i±^!L_)(i±5^) 

|A| W(G„,/1)” \A\' 

= Pr{Gn,H, A){l±t) 


Here we used simple inequalities (l±e) ^ = (l±2e), (l±ei)(l±e 2 ) = l±2(ei + e 2 ) 
that hold for any ei, e 2 , e G 0.5 and the facts that 


en 2e 

-^ < - < 0.5t < 0.5; 

NiGniA) acH 


0.5( < 0.5. 

|H| a 
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Thus we have 


P(|Pj.(G'^, H, a') — ch\ > t) < P((G„, G^) is bad) —^ 0. 


□ 

Recall that for a random bipartite graph H', p^\h, H') = E ,v*)'^h\H') 

where v* €„ Vi{H'). 

Lemma A .8 Suppose /3, n-i, n 2 and Hn are as in Theorem \3 . lY iii} . 

Then for any rooted tree H, pi^\Hn, H) A Pr{XK,Ai, H), where X^, is the 
branching process corresponding to (S', p), n = Kp given in h52\) and \5S\) . and <Tk,Ai 
is Xk, conditioned on the event Ai that the type X of T:ooi{Xif) satisfies ix = 1- 

Proof Let N = [(1 + /3)niJ. We will define a coupling between Hn and Hf ^ 
G{N, k), as follows. Construct sequences of independent random variables indexed 
by the elements of the fixed countable vertex sets V^, V^: 

{e«:uer}; v€V\ z = l,2, (60) 

{Uu,v, u € V^,v e V^}, Uu,v ~ Uniform{[0, 1]), u G V\ u G 
and an independent 


n'l ~ Binom{N, qi)-, n 2 = N — n). (61) 

Here, as before, qi = > 0 and 92 = 1 — 9 i > 0. 

We construct Hf as follows. We let VfH’^) consist of the first n' elements of V®. 
For i = 1,2 and each u G VfHf) we assign the point Xu = {iu,Wu) = (lAu^) G S. 
We define for u G V^, u G 


Pu.V ■ 


k{^XuiXv) fufv 


N 2N ’ 

and let E' = EfHf) consist of those pairs uv, u v where 

Uu,V S) Pu,V •“ ‘^Pu,v ~ (Pu,v) ■ 

Clearly, taking an independent random permutation of the vertices of Hf gives a 
copy of G{N, k). 

Similarly, we construct the graph Hn by letting VfHn) consist of the first Ui 
vertices of V®, i = 1,2, defining for u G V^, u G 


Pu,v * — 


;r{l)c(2) 


and adding {u, v} to E = E{Hn) if and only if Uu,v < Pu,v 
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We let A = Vi(H„) and A' = Vi{H’^). Denote Ay = V{Hn)AV{H!^) and 
Ae = E{Hn)AE{H'^). Since n' = Binom{N,qi) for i = 1,2 we have, using 
Chebyshev’s inequality 

E K - Nq,\ < nf ^ + iVP(|n' - Nq,\ > = O (nf 

So for 1 = 1,2 


E |n' - nj| < E |n' - Nq^} + \Nqi - m] = o{N) 


and E |Ay| = o{N). Let A]j be the set of edges incident to vertices in Ay combined 
over both graphs iL„, H^. Let uq and vq be the first vertex of and respectively. 
Using our coupling we see that there is a constant c such that for all n large enough 

EdAslln'i) < |Ay|AE(p„„,„„ +p"o.<;o) ^ 

Here we used the assumption (iii) that E^(®) < oo, i = 1,2. Hence E|A]j| = 
0(E|Ay|) =o(A). 

By the assumption nijnx —>■ /3 of Theorem 13.II there is a constant d such that 
for all M € V^, u G and all n large enough 


-p"„| < (nin2) 


< cN 


-1 


Finally, let A\ = Ae\ A^, denote by E, the conditional expectation given (l60l) . 
m. and let for i = 1,2 consist of the first N elements of V*. We have 


E|A||<EE,(|A||)<E ^ ^ \plk-Pi.k\=oiN). 

We have shown that Eln and H'^ are such that E |Ay| = o(iV), E \Ae\ = o{N) and 
E |HAH'| = E \n'i — ni| = o{N). Now Lemma FA. 71 completes the proof. □ 


Proof of Theorem I3.1L case (ii) The claim follows by Lemma TA.SI and Propo¬ 
sition IA.6I □ 


A.3 The configuration model 

We present a proof based on coupling of the breadth-first exploration of Br{Hn, rii), 
... Br{Hn,vl.) with the first r generations of k independent branching processes 
r(Di,D 2 ), see also [HI[26]. 


Proof of Theorem I3.1l fivl Let T = T(Di, £> 2 ). It suffices to show that for arbi¬ 
trary fixed positive integers r, k and any sequence Ci,... ,Ck of rooted connected 
graphs of radius at most r 




,Ck)-l[priT,Cj) 

i=i 


(62) 
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Applying this with k = 1 and k = 2 and using the same standard second moment 
argument as in the proof of Theorem 13.If al shows that pi^\Hn,Ci) A Pr(T, Ci). 
Finally, replacing r by 2r yields ([2|). In the rest of the proof we show (15^ . 

Let Yr denote the number of nodes in the first r generations of T{Di, £> 2 )- Fix 
arbitrary e > 0 and a positive integer T such that P(Fr > T/k) < e/k. Such a T 
exists, since E£>i < 00 and P(£>i = 00 ) = P(£>* = 00 ) = 0 for z = 1, 2. 

We will assume that for each n the degree sequences di = di(n), ^2 = d 2 {n) are 
indexed by V^, respectively. Recall that in the configuration bipartite graph 
Hn each vertex v in part i is assigned a list Ey of di^y half-edges. We identify the 
j-th half edge of Ey with the pair {v,j) and we think of the set Ey as ordered by 
the second coordinate of each element. There are in total N = ^ij = d2j 
half-edges assigned to vertices from each of the parts V^, Recall also that rnj 
denotes the number of elements in di that equal j. 

Using the assumption of the theorem, we can define e„ —?► 0 and —>■ 00 such 

that the event that for all n = 1, 2,..., z = 1, 2 and j = 1,..., a;„ 


^-P(A=j) 


-EA 

m 


rii 


holds whp. By considering the distribution of {di,d 2 ) conditioned on we may 
assume without loss of generality that P(i3„) = Iforeachn. Since rzi, 77 , 2 , EfV —>■ 00 , 
we can also assume that rzi, n 2 > 0 and P(A^ > 0) = 1. 

For each n we can construct the following set of 2T + k random elements that 
are conditionally independent given di,d 2 '- 

vi,V 2 , ■ ■ ■ ,Vk uniformly random elements from (63) 

hii, hi 2 , ..., hiT uniformly random half-edges from Ey; (64) 

^ 21 , ^ 22 , • ■ •, ^ 2 T uniformly random half-edges from U„gy 2 Ey. (65) 

We denote the degree of Vj by dj , and the degree of the endpoint of a half edge hij 
by dij. The necessary event implies that for each fixed j we have 


\F{d,=j)-F{D,=j)\ 


E^-F{Di=j) 

ni 


for all n such that tOn > j- So di -A Di. Now dii ~ d* and since di -A Di 

and EDi < 00 by (iv) we get dii -A D^. Similarly, Ai -A £> 2 . If a sequence 
Xn of discrete random variables converges in distribution to a random variable U, 
then dTvi^n, Y) —>■ 0. For a pair of random variables {X, Y) with dTV'(W, Y) = a, 
we can always define a coupled copy {X', Y') such that X' ^ X, Y' ^ Y and 
P(A = F) > I —a, see, e.g., [28]. By the union bound and conditional independence 
(given di,d 2 ) of Dj, vtj, j = 1,..., T, z = 1, 2, it follows that there is a positive 
sequence i5„ —>■ 0 such for each n given di and d 2 we can couple {d^}, {di j} and 
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{d 2 ,j}, 3 = 1, • ■ •, T' with random variables 


di,d2,... ,dfc ~ (66) 

'^11, ^12, • ■ ■ : ^1, (67) 

'^21, (^22, • ■ ■ ,'^2T ~ -D 2 , (68) 

so that the variables in (l66l) — ((68l) are conditionally independent (given di and ^ 2 ) 
and the event A that dj = dj, for all j = 1,..., fc and = dij for all i = 1,2, 


j = 1,.. .T holds with probability at least 1 — (5„ (conditioned on di and ^ 2 )- We 
combine the distributions dSi-dMl), defined for each realization of {di,d 2 ), and 
view them as a single (unconditional) distribution. Note that all 2T + k elements 
remain independent unconditionally, and the distribution of each particular element 
remains as in (EH)-®. 

We may further extend the sequences (1571) and (1551) by defining independent 
random variables dtj ~ D* for each i = 1,2 and each j = T + 1,T + 2,, 
independent of the previously defined variables. 

Now use {di ,..., dk}, = 1, 2, ...},* = 1, 2 to define the first r generations 

of k independent copies of T as follows. Generate the children of each node in a 
breadth-first search manner: let di be the number of children of the root of the 
first tree. Next, if the root has s children, for the j-th child add d 2 j — 1 children. 
Similarly for nodes in an even generation use random variables dij — 1, taking the 
next unused random variable from the sequence for each new subtree, and so on. 
Once the degrees of all generations up to r — 1 are determined, start a new rooted 
tree using the next unused random variable di as the degree of its root, and so 
on. Continue until the degrees of all generations up to r — 1 of the fc-th tree are 
determined and denote the resulting forest of k ordered rooted trees by F. 

In parallel we perform the following BFS-exploration of Hn (truncated at dis¬ 
tance r) that constructs a forest (an ordered sequence) F oi k rooted trees. We 
will make T steps, at each step updating F (initially empty) and Q (an initially 
empty sequence, representing the queue data structure where new elements are al¬ 
ways appended to the end, and the pop operation returns the first element of Q 
and deletes it). To keep the notation simpler we will not index F and Q by n or 
the step number j = 1,... ,T. If after some step j we have |fo(i^)| -k IQI > T, we 
say that step 3 overflows and stop the exploration (no more matches are revealed 
until step T). We also stop the exploration if we complete it or we fail at some step 
J, this is defined below. 

At step I: let u = fii be the root of the first tree in F. Step 1 never fails. Add 
di half edges of vi to Q keeping their order. 

Let y be a positive integer, y < T. Suppose the steps I,..., y did not fail or 
overflow, and the forest F after step y has ky trees and Ny vertices. We have several 
possibilities: 

• The queue Q is empty and ky = k: the construction is complete. F and Q 
remain frozen for all steps y -|- 1,..., T. 
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• The queue Q is empty and ky < k. Consider the vertex v = v^y+i- If v is 
already in declare that step y + 1 fails. Otherwise, start a new tree with 
root u in F and add the half-edges Ey to the end of Q. 

• Q is non-empty. Pop the first half-edge (u,j') from Q. Suppose u G V^. 

Suppose also that we used exactly s half-edges from the sequence hj i, hi 2 ■ • ■ j 
in the previous steps. Pair (u,j') with hi gj^i = {v,j''). If v is already in F, 
we say that step y -I- 1 fails. Otherwise, we add ?; as a child of u in F. If v 
is at distance < r from the root of its tree component, we add the remaining 
di y — 1 half-edges F„ \ fo the end of Q. 

This process reveals some pairs in the random matching that yields Hn- After 
step T we complete the construction of Hn by picking an independent random 
matching between those half edges in U^gyiF.;, and Uy^Y 2 Ey which have not yet 
been paired by the exploration process. 

Let Wj be the event that step j fails. Let Oj be the event that step j overflows. 
We can assume that after the first fail or an overflow at step j, Wt and Ot do not 
occur, and F and Q remain unchanged for all subsequent steps t = j + l,j+2 ,..., T. 

For two ordered sequences of rooted graphs L = {Fi,..., Lt}, L' = {L [,..., L^} 
we write L = L' to denote the fact there is an isomorphism that maps root(Fj) to 
root(Fj) for y = I,..., t. 

Let K be the event that no step j = 1,... ,T fails or overflows and the event A 
holds. This implies that the process constructing F must have finished after some 
step j <T (otherwise after step T we have |F(F)| >T).K also implies that F is 
an induced subgraph of K also implies that F has no cycles and no repeated 
edges. 

So on K we have F = F' := {Br{HmVi),... ,Br{Hn,Vk)}- Since the BFS 
procedure builds the forests F and F in the same order and the degrees of nodes 
are coupled to agree, the event K implies that F = F'. 

Now if A holds and F overflows at some step j then our coupling implies that 
the final forest |F(F)| > T. So 

P(F ^ F') > I - P(A) - P(|F(F)| > T) - F{UjWj). 

We have shown above that P(A) —>■ 0. By our choice of T and the union bound, 
P(|F(F)| >T)< kek~^ = e. Now let Tj be the cr-algebra generated by (di, ^ 2 ) and 
the first j steps (i.e., the random variables {%}, {uij} and {v 2 j} that are revealed 
in steps 1, ..., j). Let us bound P(IFj+i|Fy) := E (Iwq+i l-Py). On the blocks of Fj 
where Wi or Oi occurs for some I < j we have that = 0 by definition. We 

may consider those blocks of Fj where Wi,... ,Wj and Oi,... ,Oj do not occur. 
On those blocks of Fj where at step y + 1 we add a new root vertex, Wy+i occurs 
with probability at most 

Nj ^ T 

ni ~ m 

On those blocks of F where at step y -|- 1 we consider a new half-edge from Q, 
Wj+i occurs with probability at most 2T/N, since the total number of half-edges 
assigned to vertices in F is exactly 2|F(F)| -|- \Q\ < 2T. 
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Thus, using P(-B„) = 1, there are absolute constants Nq, c for all n > Nq such 
that 


< — + 


2T 


cT 

ni ■ 





3=0 

Therefore P(F = F') > 1 — e — o(l). Since the proof holds for arbitrary e > 0, we 


conclude that F{F = F) ^ 1 as n ^ oo. This completes the proof of (15^ . 


□ 


Proof of Remark 13.21 The remark is shown as part of the proof of each of the 
cases, see Lemma IATI Lemma IATbI (|66 )) - (|67jl and the argument in the proof. □ 

Proof of Remark 13.31 Let ni —>• oo be a sequence of positive integers. Let 
/? = EL>j(EL> 2 )~^ and 712 = \_(ini\. Let d' = d[{n) consist of Ui independent copies 
of D', so that d[{n), ^ 2 ( 71 ) are independent. Write S'- = To make the sums of 

both sequences equal, define Zi = (S' — Sf)+ and let di = di(n) = {d'^, ... ,d'^., Zi}. 

Fix any e > 0. By our choice of 712 , |E S 2 — E S(| = 0 ( 711 ). By the weak law of 
large numbers ^ A Ed?', therefore for all n large enough 


P(|S( -S^l > SeTii) < P(|Si -ES(| > cTii) +P(|S^ -ES^I > e77i) + o(l) ^ 0. 
So Zi{n, + l)-i < |S( - + l)-i 4 0 and 


EZV d^,3 n. S' Z, p 

— -;— =-7— H-7 


Tij + 1 ni + lm Tij + 1 


Similarly for 7 = 1,2 and any fc = 0,1,...: 













